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SYSTEM WAITS FOR A PIP 
AUDIO INDICATION TO BE 
PROVIDED BY THE USER 
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SYSTEM PROVIDES 
AN INDICATION 
THAT THE GESTURE 
WAS HOT 
IDENTIFIED 



SYSTEM RECOGNIZES PIP 
AUDIO INDICATION 



\ SYSTEM ACTIVATES IMAGE 
s ACQUISITION 
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SYSTEM IDENTIFIES A USER 
GESTURE FROM THE ACQUIRED 
IMAGE 
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SYSTEM DETERMINES THE USER 
REQUESTED PIP MANIPULATION 
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SYSTEM PERFORMS THE 
REQUESTED PIP MANIPULATION 



FIG. 2 
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SYSTEM ELICITS AND CAPTURES 
ONE OR MORE INPUT SAMPLES FOR 
EXPECTED AUDIO INDICATION OR 
GESTURE 
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SYS TEH ASSOCIATES ONE OR 
HORE CAPTURED INPUT SAMPLES 
FOR AN EXPECTED AUDIO 
INDICATION OR GESTURE 
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ONE OR HORE LABELED INPUT 
SAMPLES ARE PROVIDED TO A 
CLASSIFIER TO DERIVE HODELS 
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FIG. 3 



